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1.  A  general  remark  concerning  randomized  tests  and  confidence 
ranges. 

Let  there  be  a  class  of  distributions  P#  which  depend  on  a 
parameter  $  .  Let  X  be  the  appropriate  random  variable,  and  *  its 
realizations.  Further,  let  there  be  a  definition  of  a  family  of  null 
hypotheses  H0  (6):  Be  o0  (6)  -  Each  null  hypothesis  is 

assumed  to  have  a  specific  alternate  hypothesis  (£):  fiteo.  cV  • 

The  problem  of  constructing  a  test  for  a  specific  null  hypothesis 
Hq  (S0)  ,  which  in  a  certain  sense  is  optimal  for  the  appropriate 

alternative  hypothesis  /7<  (£p)  ,  with  due  consideration  for  a  prescribed 
safety  factor  oC  ,  frequently  leads  to  a  "criterion"  )  , 

on  the  basis  of  which  decisions  about  acceptance  or  rejection  of  the 
n<’ll  hypothesis  are  made  according  to  the  following  method: 

He  acc®Pt,«d»  if  (*  i  ^o)  ^  c 

V 0  (i0)  is  rejected  with  probability  p  ,  if  1C  (x,  )  *  c 

H p  is  always  rejected,  if  C 

The  constants  C  and  p  must  be  chosen  so  that  the  prescribed 
safety  factor  <x>  is  maintained. 

Let  us  assume  that  the  distribution  of  K.(X ,  ^  )  upon  validity 
of  Hq  depends  only  on  <T0  (but  not  on  f?f  )•  We 

now  define  the  following  quantity: 

6.r)=  Pr(%(x,^)<  '<(*•*))+  r  %(<(*,*)•  ^ 


The  function  permits  construction  of  tests  and  confidence 

ranges  in  a  very  simple  manner: 


In  order  to  bring  clarity  into  the  following  considerations,  we 
shall  dispense  with  parameter  f  ,  which  at  this  moment  is  irrelevant, 
and  write: 

S(x,r)*  P(>C(X)<  £(*))  +  '  &(*))' 

We  now  define  a  auantity  for  each  numb*  r  ot€  * )  by: 

^  {c;  PC*  >)<  c)>ot  j 

Let  R.  be  a  random  variable  equally  distributed  in  {_0,  !_}  which 
is  independent  of  X  .  In  this  case: 

1.  p‘ *(s  Ck,  £)<«.)  =  /  ,  if  1C  00  < 

Proof:  For  all  r£  ffl,  0  ,  P£K  R.  C*))<  OC 

is  valid,  if  ^ 

*■  *<*<*,**  *>*  -ir  t6)*e- 

and  P(lc(x)  *  Q  _ 

Proof:  f>*(P(K:() ()<  c^)  +  RP  (<(*)  =  Cot)<  ot) - 

^  oc-  P(*  (X)<^) 


_  ftft  /  /J  —  -  s.- Cqi)  V  -  '  o»-- 

P  V  PC*C^*V>  / 

since  P^(x)<  cg)|  *<-5  cet)  i8  because 

of  the  left-sided  continuity  of  P(X.(x)<  and  the  right- 
aided  continuity  of  P C^O*)  ^  • 

3.  P*C*C*,*)<°0*  0  ,  if  KC*)>CoL 

Proof:  For  all  r#  C<3,  q  ,  valid, 

if  K(-)>cu 

We  now  demonstrate  that  S  CX  R)  is  equally  distributed  in  T 0.  Q 
P**CS(X,R)^ 

Proof !  p*,  R  ftj  <  «c)  -  £  *  (X ,  P)  C  otj  « 

-« .  Pffc/V)-  cj  -  eL  . 
PCfW-^) 


It,  follows  from  the  preceding  that  the  tost  method  described  above 
nay  also  be  formulated  as  follows:  A  realization  (*,f)  is  determined 
and  the  null  hypothesis  is  rejected,  if  £(*,<)>  oc  ,  otherwise  it  is 
accepted.  This  method  guarantees  that  the  null  hypothesis  is  accepted 
( provided  the  prescribed  safety  factor  o<  Is  observed),  if  f?(x)< 

that  it  is  rejected  with  probability  oC  -  P  (K(*)< 

if  K(‘)~  CoL  ,  and  is  always  rejected,  if 

The  function  (*t  0  also  facilitates  description  of  the 
confidence  range  which  belongs  to  the  test  family  discussed  above: 

The  quantity  of  that  &  ,  for  which  S$(* ,r)  ^  «*-  »  contains  the  correct 

parameter  value  d  with  probability  oc  and  thus  yields  a  confidence 
ra;.hc  with  safety  factor  . 

If  Sf  (*,r)  is  a  monotonic  function  of  f  ,  the  confidence 
range  is  even  more  easily  defined:  One  simply  determines  the  value  of 

(*•  r)  ^  i 

The  confidence  range  therefore  consists  of  all  values  £>  (if  we 
assume  that  S  is  a  monotonic  decreasing  function  of  f  ). 

In  this  case  we  nay  further  use  function  £  to  obtain  a  median 
unbiased  estimate  function  for  S  •  This  function  is  given  by  S', 

(9,  page  83).  U 


These  introductory  remarks  are  designed  to  simplify  notation  in 
subsequent  chapters.  We  shall  generally  limit  ourselves  to  the  "criterion" 
K  (*,  £ )  applicable  to  each  problem.  Tests,  confidence  ranges  and  median 
unbiased  estimate  functions  are  derived  from  this  criterion  mechanically 
with  the  aid  of  function  ;  for  this  reason  the  procedure 


Since  there  are  certain  practical  reservations  against  randomized 
metnods,  the  author  felt  Justified  in  working  with  the  (non-randomized) 

function  5^  (*)  a  (*  •  2)  .  In  repeated  application  to  various 

tc3t  ;  -uieras  this  method  will  surely  produce  the  safety  factor  «*.  on 
the  average.  At  any  rate,  employment  of  function  would  be 

preferable  —  even  from  the  practical  point  of  view  —  to  the  customary 
method  which  corresponds  to  the  use  of  function  and  therefore 

yields  safety  factors  of  excessive  size  systematically  (and,  consequently, 
confidence  intervals  that  are  too  wide  systematically). 

Next,  a  case  of  practical  importance  will  be  stressed,  in  which 
C*'r.>  is  a  monotonic  decreasing  function  ef  £  • 
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Lftoma  1:  If  the  random  variable  Yr  ^  f*.®)  contains  a  nxmotonic 
increasing  density  quotient,  and  if  K(*,6)  is  a  monotonic  decreasing 
(or  constant)  function  of  <T  ,  than  Sr  fx  r)  is  a  monotonic  decreasing 
function  of  J  .  " 

Proof:  It  i 3  known  (9,  page  74)  that  ^(y) is  a  monotonic  decreasing 
function  of  S  for  every  monotonic  decreasing  function  associated  with 
the  aonotonic  increasing  density  quotient  Eg  *  Therefore, 

j  I  is  valid  for  y  <■  K  (*,&) 

'  |  f  1b  valid  for  y  * 

/  0  ia  valid  for  y  |c  Qk,&) 

Egt  is  valid  for  S<L  &'  . 

However,  in  view  of  ^CY'^  ) 

is  also  valid  for  (T  ^  ,  therefore: 

% ■ 'f' f r, a; >  es,  't'(Y,s‘) 

tnd:  Ej  y, ,  ffT,:*)  Is  valid  for  £  <  &  . 

However,  since  Eg  is  valid  for  function  w; 

defined  above,  the  lean  is  proved, 

2.  Exponential  distributions. 


Lst  us  consider  a  class  of  distribution  functions  whose  density 
with  respect  to  an  O'  -finite  measure  fx.  defined  in  Euclidean  space 
is  represented  by: 


./s  o)  .  r  (q\  r  6*~l 

f  v-t  /  v  v  i  - 


where  fr(*,0)4/*'C*)=  l 


is  valid  for  /  f  O 


The  atudy  of  such  exponential  distributions  is  of  particular  Interest 
because  an  arbitrary  class  of  distributions  having  a  function  t,  C *)  , 

which  is  exhaustive  with  respect  to  its  parameter,  may  always  be  reduced 


to  the  form: 


p(*.  &)  *  C  C0to)J  h  0) 


by  appropriate  transformation  of  the  parameter,  provided  a  certain 
regularity  exists  and  the  range  in  which  the  densities  are  positive  is 
independent  of  Q  (8).  Since  "t  C *)  i»  exhaustive  with  respect  to  6  f 

we  may  restrict  ourselves  to  the  distribution  of  "L  .  The  latter  has 
density  C(6)t*p[0t]  with  respect  to  an  appropriate  measure  /*■  C^)  > 
which  leads  the  general  problem  back  to  fora  (l). 
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Thus  the  results  obtained  from  a  3tudy  of  distributions  (l)  are 
applicable  to  a  large  class  of  distributions  with  exhaustive  functions. 
Wc  shall  subsequently  discuss  th^sir  applicitioi  to  certain  discrete 
distributions . 

3.  Criteria  for  parameter  0  . 


Theorem  1:  The  criterion 

K  « 

defines  a  most  powerful  test  for  the  null  hypothesis  B  z  Bq  with  respect 
to  all  altemati'e  hypotheses  6>  8.  . 

„  0  rO,e) 

Proof:  For  v  >  U*.  ,  the  quotient  of  densities  — a  \ 

is  a  monotonic  increasing  function  of  *  .  For  this  reason,  the  above 

criterion  leads,  according  to  Theorem  3.2  in  Lehmann  (9,  page  68)  to  a 
uniformly  most  powerful  test  in  the  range  0 >■  0^  . 

We  shall  now  considsr  the  null  hypothesis  %-B‘9e  and  the 
alternative  hypotheses  H('  0*0$  “  &  and  H ^  G*  G0  ■+  ^  .  There 

is  no  test  that  would  be  most  powerful  with  respect  to  both  alternative 
hypotheses.  We  must,  therefore,  resort  to  another  concept  of  optimality. 
Assuming  that  each  of  the  two  alternative  hypotheses  H ,  and  hx  has 
the  same  probability  if  is  not  valid,  and  that  upon  rejection  of  ne 

a  classification  (i.e.,  a  decision  for  or  )  must  taka  place, 

we  may  use  the  optimal  test  and  classification  method  developed  in 
(11,  pa.3e  15).  The  method  is  based  on  the  following  criterion: 

^  [rfr'Qo-A)  pC'.Qo+A)  “)  _ 

(2) 

On  the  basis  of  this  criterion  we  must  define  a  function  SC”*r) 


On  the  basis  of  this  criterion  we  must  define  a  function 


and  reject  Mg  if  {x.r)  ^  ot  .  The  decision  is  then  made  in 

favor  of  W  or  Hx  ,  whichever  is  associated  with  a  larger  quotient 
of  density  functions.  (If  both  quotients  are  identical,  the  decision 
is  made  arbitrarily). 


A  test  and  classification  method  constructed  on  the  criterion 
defined  by  (2)  guarantees  that  the  probability  of  a  correct  classification 
is  maximal  in  the  event  H  is  invalid,  provided  the  prescribed  safety 
factor  Is  observed  (11,  page  14,15). 


S 


Criterion  (2)  also  depends  on  quantity  ^  .  If  our  interest  is 

focused  principally  on  alternative  hypotheses  near  the  null  hypothesis, 
tie  nay  employ  a  locally  optimal  test  and  classification  method.  We 
obtain  the  latter  by  using  criterion 


Since 

(3) 


t'C») 

c(o) 


Y  '  f  M  ..  3  k  (*  • 

K  M)-  —j^r 


-  -  (*) 


A  -  0 


9K  (t,B) 
3A 


is  valid.  This  proves: 
Theorem  2:  The  criterion 


*  M«  [-«♦£,  00, 


0-0 


K' (>.%)*  I«-^.  C*)l 

0 


defines  a  locally  optimal  test  and  classification  msthod  for  null 
hypothesis  $  *■  at  point  ^*0  Kith  respect  to  alternative 

hypotheses  H, : 0~ 6^  -  A  and  0*  9q  f  A.  » 

Generally  speaking,  the  two-sided  problem  described  here  does  not 
require  delineation  of  tbs  critical  regie n  in  such  a  manner  tnat  both 
parts  of  the  critical  region  have  the  sane  probability,  but  demands  that 
they  be  symmetrical  about  the  expected  value.  We  may,  of  course,  find 
a  (bilaterally  bounded)  confidence  interval  for  0  on  the  basis  of 
this  criterion. 


Another  possibility  of  obtaining  a  test  which  is,  in  a  certain  sense, 
optimal  for  hypothesis  H q  with  respect  to  alternatives  and  , 

is  offered  by  limitation  to  universally  unbiased  tests.  The  class  of 
unbiased  tests  contains  a  test  which  is  most  powerful  with  respect  to 
both  alternatives.  This  test  still  depends  on  ^  .  Here,  again,  a 

locally  most  powerful  test  may  be  constructed  when  alternatives  close  to 
the  null  hypothesis  are  involved.  The  test  obtained  in  this  manner  is 
found  in  Lehmann  (9,  page  126,  127).  It  is  not  identical  with  the 
criterion  formulated  in  Theorem  2. 
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4.  The  2-sample  problem:  Criteria  for  the  difference  of  tro 
parameters. 


Let  there  be  two  distributions  with  densities 

?L  OS  =  (&i)  t/p  C&£*],  l  ~  1,2 


with  reference  to  measures  or 

by  two  new  parameters  9  and  <f  : 


(4) 


9t  --  9  *cT 
9% 


We  now  replace  &t  and  9X 


era  o 

and  search  for  a  test  for  the  (combined)  null  hypothesis  °  ~  °o >  (  ' 

i.e.,  we  look  for  a  test  for  the  difference  of  6,  and  without 

making  assumptions  of  any  kind  about  the  magnitude  of  either  parameter. 

The  combined  density  of  ^X» '  with  respect  to  measure  /<--/*■  % 

13  p ' xj2 ' 9  £)-  c(C*+s) ciC*)c->if>L&c*i+  *x)+  SxtJ  . 

We  now  consider  the  conditional  distribution  of  I  "r^2  ~  *  ' 

According  to  Lemma  2.8  in  Lehmann  (9»  page  52),  there  is  a  measure 
Vjjfxj)  which  is  such  that  the  conditional  distribution  of  X,  |  X,  ** 
has  the  following  density  with  respect  to  this  measure: 

(5)  ££»,! 

This  density  is  independent  of  0  (i.e.,  function  x  *  */  * 

is  exhaustive  with  regard  to  parameter  9  ) .  Furthermore,  the  same  lemma 
provides  that  the  distribution  of  X  (for  stabilised  S'  )  is  an  ex¬ 
ponential  distribution  with  the  parameter  $  ,  and  therefore,  according 
to  Theorem  4.1  in  Lehmann  (9»  page  132),  complete  for  every  value  of  a  , 
Consequently,  based  on  a  familiar  theorem  of  Lehmann  and  Scheffe 
(cf .  Theorem  4.2  in  9»  page  134)>  every  similar  test  for  the  above- 
mentioned  null  hypothesis  has  a  Neyman  structure,  i.e.,  it  is  a 
conditional  test  for  x  .  Thus  the  problem  of  finding  a 

similar  test  for  the  above-named  combined  null  hypothesis  is  reduced 
to  the  problem  of  finding  a  test  for  null  hypothesis  5**  0$  based  on 
density  (5).  This  proW.w»  was  solved  in  the  preceding  chapter.  We 
proved: 


7 


Theorem  3:  The  criterion 

K("r  V  "o')  ‘ 

defines  «  test  for  tl«.  "'■Ttfred  null  hypothesis  d  s 
which  is  nos t  power  Ail  similar  with  reference  to  all  alternative 
hypotheses  <T >  <5^,  D,  ,  if  we  base  our  construction  of  function 

*,**2  *  11  (For aula  5)* 


on  the  conditional  distribution  of  a, 


Similarly,  we  may  derive  the  corresponding  two-sided  problem  from 
Theorem  2: 

Again  we  start  with  representation  (4)  oi‘  parameters  6,  and  9 ' 
by  $  and  €  ,  and  obtain: 


Theorem  4:  The  criterion 

k'O,.  <%.i,)'h,-ei0cx,\‘)l 

defines  a  test  and  classification  method  .‘or  the  null  hypothesis 

^  *£0,  &  (  -fl-  which  is  local  optimal  similar  at  point  *-  0  with 
respect  to  alternative  hypotheses  ,  6  (  XI  and 

H  -  €  -  &0  *  &  ,  6  f  O-  ,  if  we  base  our  construction  of  function  S 
on  the  cond}  clonal  distribution  of  I  *  ^2  *  *  (Formula  5). 


5.  Some  versions  of  the  k-sample  problem. 

Let  there  be  fc  distributions  with  aensitins 

r i'  1,2,"  L 

with  respect  to  measures  /<  t  *  2  !  '  "  /^k  '  assume  that  the 

parameters  0f,  depend  on  two  parameters  9  and  in  the 

following  form: 


(6) 


9c  (£J  -  J *  A 1  (t ) ,  w<tli  &  =  0  l  *  1,2,  —  k. 


Theoras  5:  The  criterion 


£  0,’*x'"  *k'°) 9  £  Ai  0’)n 


defines  a  test  for  the  combined  null  hypothesis  ^  ■-  0w9e\l  which  is 
most  powerful  similar  with  respect  to  alternative  <Tt  £t,  -fi.  ,  if  we 
bLse  our  construction  of  Amotion  S  on  the  conditional  distribution 
of  2(  X  2  >  • "  t 

**  I  E  V  *  * 
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Corollary :  In  the  special  case  A-  (<f)-  ^  •  <T  we  finally  obtain 

rn  on  **  ^ 


the  criterion 


which  is  independent  of  (f.  ,  making  the  test  most  powerful  with  reference 
to  all  £  >  0  • 

Particular  interest  from  the  practical  point  of  view  is  due  in  two 
cases,  where  -ter  i  *  1, 2, ...  k  and  A;  *  I  t  «  I.  ...  /.  > 

A;  -  0  t.  7  ' 

The  proof  of  Theorem  5  is  obvious:  The  combined  density  of 

C\.X3l^Xk)  with  reference  to  measure^,  y  is: 

p(*r  *2'"'  \JJCi  (04&i 

We  now  consider  the  conditional  distribution  of  IE*r- 

i 

According  to  Lemma  2.8  in  Lehmann  (9,  page  52),  there  is  a  measure 
(ki'*2>  "  *,  )  which  is  such  that  ^he  conditional  distribution  of 

V  *JS*i  ** 

with  reference  to  this  measure  has  the  1 following  density: 


Cx  £S)«-*P  (0*i 


This  density  is  independent  of  ^  .  The  same  leana  provides  that 

the  distribution  of  *  (for  stabilized  <T  )  is  an  exponential  distribu¬ 
tion  with  the  parameter  6  and  therefore,  according  to  Theorem  4*1  in 
Lehmann  (9,  page  132),  complete.  For  this  reason  and  according  to  the 
above-mentioned  theorem  of  Lehmann  and  Scheffe  (Theorem  4.2  in  9,  page 
134),  every  similar  test  for  Ss.OtB(JTL  has  a  Neyman  structure,  i.e., 
it  is  a  conditional  test  for  X,4M.«+  x  .  Consequently,  if 
we  construct  the  quotient  of  densities  (7)  for  £  and  S' *-  0  , 

we  see  that  it  is  a  monotonic  function  of  k_  ’  , 

SA«)»i 

proving  Theorem  5.  i 

Let  us  consider  another  version  of  ths  k-sample  problem:  Let  there 
be  en  hypotheses  ^  .  Hypothesis  Wj  establishes  the 

value 8  A»ij  (S )  in  a  quite  specific  manner  (where  AijC1?)*  0  always 
applies  to  all  i,j  ).  We  shall  discuss  the  following  cases  in  detail: 
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a) 


fO 


\  0  {or  L  4J 


In  this  case  hypothesis  Hj  indicates  that  the 
—  and  only  this  one  —  has  a  parameter  value 
the  remaining  distributions. 


,th 

J  distribution 
which  deviates  from 


{ 


&  i  >  J 

0  ■£o>-  c  <  J 


A. 


In  tliis  case  hypothesis  indicates  that  a  Jaap  bad  occurred  at 

point  j  and  that  the  distributions J,j-t  k.  have  a  parameter 
which  aeviates  from  the  remaining  distributions. 


In  treating  the  problem  formulated  above,  we  assume  that  each  of 
the  alternative  hypotheses  W(i  has  t.  same  probability,  and 

that  a  decision  must  be  made  for  one  of  the  alternative  hypotheses,  if 
the  null  hypothesis  H  ■  0,  Of  Q.  is  invalid  and  must  be  rejected. 


rhtorem  6:  The  criterion 

K  (-,■  «i . k,  0)\*? 


defines  a  test  and  classification  method  for  the  combined  null  oypothesis 
CO,  6f  JCL  which  is  local  optimal  similar  at  point  0  wi  n  respect 

,  if  we  base  our  construction  of 


to  alternative  hypotheses  W,  H „ 

function  on  the  conditional  distribution  of 
(Formula  7) . 


X  ,  X, 


truction  of 


Corollary  a:  In  the  special  case  a,  we  get  the  criterion: 

(Xjlx)J 

J~  '  "  A  tor  all  f>  0  ) 

Incidentally,  this  criterion  is  uniformly  optim&l'in  case  all  measures 
M,  -/S'  /'a  are  identical,  and,  since 


is  further  simplified  to: 


/lex  j>j] 
J  *  I,  * *■ *  ^ 
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Corollary  b:  In  the  special  case  b,  we  get  as  criterion: 

<8)  ^r.u 

L  lv  ^ 

Since  the  proof  of  Theorem  5  wad  presented  in  detail,  proof  of 
Theorem  6  may  be  abbreviated.  As  above,  it  is  shown  that  every  similar 
test  of  the  null  hypothesis  must  be  a  conditional  test.  According  to 
(11,  page  15),  _  r, 


Ma,  r  . . .  1 


is  the  optimal  criterion  of  decision.  Here  Pj  (*  »  0  is  the 

specific  form  assumed  by  density  (7)  under  hypothesis  Hj  ,  therefore 

rj  (' . «fc.0-c»°V0  [H  csJ*cJ- 

P  (*,***  *k>^)  is  the  density  in  case  of  validity  of  the  null  hypothesis. 
In  order  to  obtain  a  local  optimal  decision  procedure  at  point  0  , 

we  merely  employ  the  criterion 


P'Ux  f  (y  ^  °)  1 

jel.wxWv  Lp  (x  ,%ks  n,  ,  o)  J 


J  '• .  L  P  <-V'"  J 

where  fj  ("V  "  is  the  derivative  of  *■ .  after 

<T  at  point  5*5  0 

Since  ^  _ 'T”*  >  (o)  )  *)/ 

CU?  (0)  -  Ls  y 

p  x^,  o;  "7" 

is  valid,  which  proves  our  theoran. 

Specialization  for  k  *  2  leads  us  once  again  to  the  2-sanple 
problem: 

Theorem  5  is  converted  to  Theorem  3,  since  the  quantity 
A(  (y,)t  iA^CS,)  is  a  monotonic  function  of  x^  for  xf+x2  *  x 


11 


Theorem  6  is  converted  to  Theorem  k,  if  we  make  the  obvious 
assumption  that  and  Atl  (&)  .  In  this  case  the 

criterion  of  Theorem  6  proves  equivalent  to  i,  _ c  A  I,)  I  because 

*(4  «.  *  .  i  i  wO  ^  /  i 

6.  The  k- sample  problem  upon  r  repetitions. 


Let  there  be  k  x  r  distributions  with  following  densities 
with  reference  to  certain  measures f,  V"  r )  ' 

k 


U  1 ,  *,-> 

J  •  r 


W«  Hiuat  that  parmMt.ro  vij  .r.  dapandent  on  certain  paraaatara 
(>r<)2,*.6T.S)  in  the  following  manner: 

m  wiiv.  a^o) -0 

We  are  faced  with  the  task  of  finding  a  similar  test  for  the 
combined  null  hypothesis  H0  i  $  •  Bj  (  XI  f  J  »  /,  2,  *»*  r  . 

In  practice,  such  a  problem  occurs  when  a  test  is  designed  to 
establish  whether  k  distributions  are  identical  and  the  test  is 
repeated  r  times,  although  it  is  impossible  to  arrange  repetition  * 
such  a  way  that  9j  has  the  same  value  for  all  repetitions.  The 


formulation  chosen  above  seems  adequate  when  the  situation  promises,  . 
upon  failure  of  the  null  hypothesis,  that  values  A,  (i),  C&. ) 

are  identical  in  all  repetitions,  e.g.,  that  an  increasing  trend  is 
prevent  in  all  repetitions,  or  that  ans  distribution  deviates  from  the 
others,  that  this  distribution  is  not  identified  from  the  start,  although 
it  is  clear  that  the  deviation  would  be  valid  for  all  repetitions  and 
would  always  involve  the  same  distribution. 


As  in  the  preceding  chapters,  ws  consider  that  each  similar  test 
must  be  a  test  with  Nsyman's  structure.  Ii  we  desolate  the  xeaLiza- 
tions  of  distribution  p;'  as  *£\  ,  and  *£|  +  »«.*+■  *  *£  t 

while  »ij  •*• ...+ '•jl}  »  x.  j  ^  *  we  may  fonmlate  the  following 

theorems :  * 


Theor« 


The  criterion 


WiAUOl  J.WU  M  -  f  f  \ 

defines  a  test  for  the  combined  null  hypothesis  £*■(),  r 

which  is  most  powerful  similar  with  respect  to  the  alternative 

,  Bj  €  H ,  J  =  1,2,  "v  f  ,  if  we  base  our  somatruction  of  function  -S 


on  tlie  conditional  distribution  of  X 


It 1 


X„  J*. 


I  *  V 


X-r  * 
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Corollary:  In  the  fecial  case  Aj(&)w  we  ultimately  obtain 


the  criterion 


^  '"'*kr  <°)  c  II  "t* 


which  is  independent  of  C.  ,  making  the  test  most  powerful  with  respect 
to  all  f  >  0 

The  proof  is  analogous  to  proof  of  Theorem  5* 

Theorem  8:  The  criterion 

. 

defines  a  test  and  classification  method  for  the  combined  null  hypothesis 
i'0f  f£l ,  r  which  is  local  optimal  similar  at  point  f  ~  0 

with  respect  to  alternative  hypothesis  ,  if  ws  base  our 

construction  of  function  S  on  the  conditional  distribution  of 

*"  ^kf  I  ^*| s  **1' \ 
(Hsre  hypothesis  Hf,  determines  the  values  (&)  with  *>  £),.$/• 

Cf.  page  10  for  this  and  special  cases  a  and  b) . 

Corollary  a:  In  the  special  caee  a  we  get  as  criterion: 


Incidentally,  this  criterion  is  uniformly  optimal  (for  all  )  in 

case  measui’ea^jj,/*.*  Jt,  j  for  constant  j  are  identical,  and  is 

mjjH  «  4n  V  J 

.  c-iO 

i. 


Corollary  b:  In  the  special  case  b  we  get  as  criterion: 

7 •  Some  theorems  pertaining  to  reproductive  distributions. 

Premise:  Let  there  be  a  class  of  distributions  which  depends 
on  two  parameters  6  and  A.  >  and  wnich  is  coiq>lete  for  every  fixed  A.  . 
Further,  let  this  class  be  reproductive  for  every  fixed  B  with  respect 
to  the  parameter  K  : 
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and  let  the  sum  variable  X“  Xj  •f  V  ^  be  exhaustive  for  all  values 
with  respect  w>  parameter  £  . 

Let  k)  be  independent  random  variables  with  distribu¬ 

tions  py,  .  The  sum  variable  X  in  that  case  has  the  distribution 

^h  **  tl  • 


2:  On  the  premise  defined  above, 

_  .  .  .  X.- 


efxj,), 


Proof:  Due  to  reproductivity. 


E(xd 

'V  %  /X*  \ 

For  this  r«aaODf  E*(E  (xL  |X ))  =  £<V£)  *  ^ E  (X)'  E \k  X)  ‘ 
ti»t i»,  4i_x  =  0' 


According  to  the  premise,  X  is  exhaustive  with  respect  to  B  ,  and 
£” ftL  |*)  ie  independent  of  $  .  Furthermore,  since  X  has  distribu¬ 

tion  T>  ^  and  the  Latter  is  complete  according  to  the  premise, 

A. 

6  j  m) - x  *  0  ia  valid  for  all  x  ^  -  nearly  everywhere), 


Lemma  3:  On  the  premise  defined  above, 

WO-'rO- 'xLjv6  ^ 

CfX.xy^.-iL.-kvC.A), 

x,  («,  V)  is  the  unbiased  estimate  function  for  V£v  . 
Proof:  Due  to  reproductivity, 

v(x;).  ~v(x). 
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Therefore,  E*(^(XL  IX))*E  *(  £  fa  l  I  X  ))  -  I  *f)  ’ 

=  £  (*th  vW'eW5'7^x1> 

= v  v^>  ¥ £&>1 4£^4('4) v{x)- 

-  . .,, 

V(xJ*)  w/x) 

Thus  — "  is  an  unbiased  estimate  function  for  v  (  *  J 

¥('-¥) 

(independent  of  £  ).  However,  since  the  distribution  of  X  is  complete 
for  every  value  of  A.  ,  the  unbiased  estimate  function  for  V  Q*) 

(  ^  -  nearly  everywhere)  is  clearly  defined.  If  we  designate  the 

unbiased  estimate  function  for  V(&)  by  V  ,  then 


^  -  nearly  everywhere) 


is  valid  and  proves  the  ratio  of  variances.  The  ratio  of  covariances 
follows  directly  from 

V(X;  +Xj  I*)  =  V(xL  1  ,)4  V(x}  l*)+2  C  C*iXjl>) 


Lemma  4:  /  k  \ 

k  '  \  '  k  _  ,  A; 

l  1 


The  proof  is  derived  directly  from  Lemmas  2  and  3. 
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8.  Specialisation  of  general  theorems  concerning  exponential 
distributions  for  reproductive  exponential  distributions. 


Although  notation  of  density  in  fora  (1)  and  incorporation  of 
any  factors  independent  of  6  in  the  measure  /*■  facilitated  general 
considerations,  it  is  more  feasible  in  application  to  let  the  measure 
remain  fixed.  This  is  particularly  valid  in  case  a  second  reproductive 
parameter  K.  exists  in  addition  to  6  .  In  this  event  we  take  the 
isure  as  a  basis,  which  is  independent  of  A,  and  with  respect 


to  which  P. 


has  density 


[?'*]{(*> K)  k 

In  this  case  the  conditional  distribution  of  X  ,  X  ,  X,  J  21  ~  9 

fc  l 

has  density  k  . 

rk  1  TJfOi^i) 

00) 

with  respect  to  measure/* O, ) */*  (*))**"  * /*  under  the 

conditions  listed  on  page  13.  (This  formula  replaces  Formula  7,  p.  9). 


(11) 


In  case  £*  0  ,  this  density  becomes 


for  all  t«  k  due  to  C°)  '  0 


Learns  2  permits  further  simplification  of  criteria  defined  in 
Theorems  4,  6  and  8  in  case  of  reproductivity,  and  allows  conversion 
to  even  more  feasible  forms  under  certain  conditions.  We  shall  dispense 
with  their  detailed  treatment  and  merely  list  a  few  examples: 


Theorem  4  (page  8)  proposes  as  criterion. 


In  case 


<r  *  o 


this  criterion  proves  equivalent  to 


(12)  Ku(*r 

due  to  E (x f  j a) 
are  consiaered. 


,  when  A  i  +  ^-2.  *  *  Af  *  Z 
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Furthermore,  we  may,  under  certain  conditions,  approximate  the 
criterion's  distribution  by  a  normal  distribution,  if  tha  null  hypothesis 
is  valid.  This  requires  standardization.  Upon  application  of  Lemma  4, 
standardization  of  criterion  (12)  leads  directly  to  form: 


(13) 


■V)  6X5 


Upon  standardization  of  the  criterion 


I 


listed  as 


corollary  to  Theorem  5  (page  8),  Lemma  4  immediately  supplies  the 
standardized  form  of  the  criterion:  . 

k  * 


(14) 


or 


2  c&i  -  eWk 

i _ 

V  V[A]v(*,A) 


where  f  [A]  *  £  A-  ^ . 

I  i 

Finally,  we  shall  specialize  Theorem  7  (page  12)  for  t  '  2  in  the 
caee  of  reproductive  distributions.  Ae  in  Chapter  5,  we  consider  that  the 
criterion  is  simplified  to: 

Ir'  f »  •  o)  ?  >! 

•-  V  It  *.r  '  /  / 


in  case  ^  ~  2  .in  case  of  reproductivity  we  get  the  criterion 


by  standardization  oased  on  Lemma  4. 


In  practice,  one  frequently  has  a  free  choice  of  parameters 
A.  ,  provided  certain  limitations  are  observed  (e.g., 

k1  1 

\\  <  A.  ^  ).  In  this  paper  we  shall  not  discuss  how  parameters 


about  &  . 


are  to  be  chosen  in  order  to  afford  precise  information 
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9.  Application  to  various  discrete  distributions. 


There  are  various  discrete  distributions  which  are  amenable  to 
conversion  to  au  exponential  fora  through  appropriate  transforation  of 
their  parameter,  and  are,  in  addition,  reproductive  with  respect  to  a 
second  parameter.  Specifically,  these  include: 


Blnoaial  distribution: 


Trane format ion 


Poisson  distribution: 


— 


~~l~"  yields 


This  notation  deviates  from  the  customary  style  in  that  the  mean 
value  has  been  entered  aa  the  product  of  two  parameters.  The  additional 
parameter  A,  proved  feasible  because  many  applications  present  a 
situation  in  which  the  mean  valuaof  distributions  are  related  in  certain 
known  proportions  (  b-i  )  ,  while  the  common  factor  of  proportionality 

(  Q-  )  is  unknow . 

The  transformation  6  * I ^  produces:  txp  [- 


Negative  binomial  distribution: 


r'C 


f)‘ 


The  transformation  ? 
Pascal  distribution 


loa,  D 
i 


•  (lT'j 

Lelds:  '  ) 

'  /}X 

The  transformation  ^  0~p)  yields:  ex  rP-Jh-i ). 

It  is  known  that  all  of  these  distributions  are  reproductive  with 
respect  to  parameter  A.  ,  making  them  accessible  to  Lenaas  2,  3  and  4* 

As  shown  by  an  elementary  computation,  the  function  v  (x ,  O  takes  the 
following  form  in  these  four  distributions: _  ' 

/  i\.  *  C^r~K) 

Binomial  distribution:  \  •  I 

Poisson  distribution:  *  * 


Negative  binomial  distribution:  V  (x  >  \) 
Pascal  distribution:  u  A,  A.)  ~  — 

v  7  A+l 


\) 


(*+*) 
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Basically,  we  can  now  explicitly  elaborate  any  one  of  Theorems  1-8 
(when  required,  with  due  consideration  of  Lemmas  2  and  U)  for  each  of 
the  specific  distributions  listed  above.  Instead,  we  chall  isolate 
certain  special  cases: 


1.  Propositions  about  the  difference  of  two  parameters. 


In  order  to  make  unilateral  propositions  about  the  difference  of 
two  parameters,  we  must,  according  to  Theorem  3  (page  8),  depart  from 


the  criterion  *,  and  consider  the  conditional  distribution  of 

*,IVV* 


According  to  Formula  10  (special  case  &  i  (6)* 

),  this  conditional  distribution  possesses  the  density: 


- JcTT) - ' 


where 


Specializing  this  general  formula  in  the  Poisson  distribution, 
we  get  density:  /  r  \%  i  >  „  S'  \*-* 


I 


I 


V 


tfc. 


This  is  the  density  of  a  binomial  distribution  with  parameter 
1 


D- 


Tn  mmnnt (  n rv  fimnH 


KS 


marr  f  kmvm/Ama  awws  I  «vr  ♦Ve 
ussj  wuv*  wav*  w  vtiipAwj  wuu 


tables  of  eummable  functions  of  the  binomial  distribution.  In  this 
manner  we  obtain  a  proposition  with  respect  to  <f  _ .  Even  more  clarity 
is 


about 


obtained  by  converting  this  proposition  about  f  into  a  proposition 
ut  ,  since  — —  because  9  *  I  rJ  • 


We  thus  obtain,  as  desired,  a  test,  a  confidence  interval  or  a 
median  unbiased  estimate  function  for  the  quotient  a j  .  The  test 

®-i  |  ^ 

for  hypothesis  -jr-  —  1  (i.«.,  a.  =  «•  -  )  was  first  defined  by 

Przyborowski  and  Wilenski  (12).  Its  optimality  was  proved  by  Tocher 
(13).  Bross  (2)  employed  these  considerations  in  the  construction  of 
a  confidence  interval. 
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Specialization  in  the  case  of  binomial  distribution' leads  to 
hyperge^metric  distribution  if  f =  0  .  We  thereby  get  R.A.  Fisher's 

(5)  teat  as  optimal  test  for  hypothesis  o-0  (i.e.,  P ,  ?x  ). 

Its  optimality  mss  aieo  demonstrated  by  Tocher  (13).  In  the  general 
case  we  get  a  test  of  an  arbitrary  value  of  i"*  (or  a  confidence 
interval  or  a  median  unbiased  estimate  of  S'  ).  If  we  convert  this 
proposition  about  €  into  a  proposition  about  the  original  parameter 


{*  i  P 2.  »  w®  8°i  a  proposition  about 


P* 

,0)TTt 


°3  TT 


because 


about 


,  or  by  antiiogarithmic  treatment,  a  proposition 


/ 


This  appears  unwelcome  at  first  glance,  since 


one  is  accustomed  to  thinking  primarily  in  values  of  P  and  a  proposition 
about  (  P,  -  )  or  />  would  be  mors  desirable.  Actually, 

neither  £(>  *  nor  p7~  a  plausible  measure  for  the  difference 


between  P,  and  Px  -  An  increase  from  to  2%  ie  not  the  same  as 
an  increase  from  100  to  Ujt  or  from  500  to  5Ut.  Nor  is  it  the  same  as 
an  increase  from  100  to  200  or  from  500  to  lOOjl.  On  this  basis,  holler 
(?,  page  324)  proposed  —  independently  of  the  present  considerations  — 

/  P,  \  \ 

the  value  /  — —  '  ~ -  i  as  measure  of  the  difference.  Moreover, 


the  transfo 


/  P,  x  \ 

I  1 — ~ —  >  ~ — - —  J  as  measure  of  the  difference.  Moreover, 
'ormatlon  B-  log  — — is  widely  accepted  in  connection  with 


the  evaluation  of  activity  curves.  It  follows  naturally  from  the 
aawuqjtion  that  the  effectiveness  of  a  drug  (i.e.,  the  portion  f*  of 
thoae  cases  in  which  the  affect  was  observed)  depends  on  the  dosage  in 
fora  of  a  logistic  function  (cf.,  for  example,  Berkson  1).  Finally, 
let  me  point  out  that  Weichselberger  concluded  in  (17)  that  the 
/  ?  >  .  Px  \ 

quantity  I  - — —  >  ^  “  J  is  an  adequate  measure,  in  certain  models, 
for  the  degree  of  dependence  in  a  2x2  contingency  table. 

2.  A  test  against  trend. 

If  Formula  (14)  (page  17)  for  A-I  »•  b  Is  specialized  in  the 
case  of  binomial  distribution,  the  result  is: 
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i L 
l 


k 


1  -/ 


ic  \- 

V[g-  Z  (- 1  _££0^3 


Essentially  the  same  criterion  is  obtained  by  specialization  of  the 
results  of  Yates  (I*,,  p.  178)  in  the  case  of  a  2xk  table.  (The  sole 
difference  consist?  in  the  fact  that  Yates  employs  A,  in  the  denominator 
in  place  of  \—f  ).  Yates'  formula  was  adopted  by  Cochran  (3,  p.  435, 
Formula  I4),  where  the  reader  finds  practical  examples. 


Incidentally,  this  teat  shows  a  certain  kinship  with  the  test  of 
v.  Eeden  and  Hamelrijk  (4),  although  it  is  by  no  means  equivalent  +0  the 
latter.  Using  our  designations,  the  criterion  employed  by  v.  Eeden  and 
Hemelrijk  is:  l  1,1  . 

1  _ 


Aside  from  employment  of  the  unweignted  mean  k  * 1  in  place  of 

r  i  — 

V  a. 


criterion  on 


,  an  essential  difference  involves  construction  of  the 

*  ■ 

L 

— -  instead  of  on  *; 


T  *  l - - .).l  ....  )  <  ,  | 

-Lit  Ulic  1 

accept  the  values  * •  *  0  and  x - 

t 


z 


c  x. 
c 


which  follows  for  *  c 


^  ^  nnl  w 

*  *  1  t  ’■“**  w 

.  Consequently  the  criterion 
from  the  corollary  to 


Theorem  5  (page  3)  becomes  identical  with  the  sum  of  those  rank 
numbers  c  for  wnich  *■  i  .  This  criterion  of  rank  suns  was  used 
by  Haldane  and  Smith  (6).  It  proved  to  be  equivalent  to  Wilcoxon's 
(Mann-Whitney)  test,  if  the  quantity  [l:  *1  =  I  j  is  considered  the 
rank  numbers  of  one  sample  and  the  quantity^  1  .  v-  s  Q  1  as  the 
rank  numbers  of  the  second  sample.  1  c  J 
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i ,  The  combination  of  2x2  contingency  tables. 


Specialization  of  Formula  (15)  (p»g®  1")  CAB®  °*  binomial 

distribution  yields:  . 

4-~Tr~V-j  *,)) 


^~XT~V'j  *,)) 

-\fy 

Vv  ^  l-r' 


Since  the  test  for  independence  in  a  2x2  contingency  tahle  can  also 
be  reduced  to  a  comparison  of  two  binomial  distributions  (cf.,  for  e«^le, 
Lehmann  9,  page  143),  we  may  apply  this  formula  to  the  combination  of 
2x2  contingency  tables.  It  will  be  optimal  if  an  existing  dependence 
has  the  same  direction  in  every  contingency  table  and  corresponds  in 
quantity  to  a  constant  logit  effect,  i.e., 

_ J.  P’J  _  for  all  j*  >,*,—  r  . 

I-  paj  I'flj 

The  custom  of  eimply  adding  the  X2  values  of  individual  contingency 
tables  is  unfeasible  in  this  case  because  the  direction  of  dependence  is 
not  being  considered,  ana  the  consistency  of  dependence  in  the  various 
contingency  tables  does  not  come  into  play. 

A  somewhat  refined  criterion,  „  , 


7*-  f  i  /2Z2Sl2X 


I  a.,j 

which  is  based  on  Formula  (13)  and  also  finds  frequent  employment, 
differs  only  in  weighting. 

Criterion  (16)  was  described  by  Cochran  (3,  p.  444).  The  formula 
is  not  strictly  substantiated  by  him,  although  his  princip^a  of 
optimization  (3,  page  448-450)  is  essentially  Identical  with  ours. 

Relevant  examples  of  the  application  of  this  fonnula  are  found  in 

A  /  o  uni  /  am  veil  a«  in  Paapsoh  l  TO)  Slid  I&t88  \15»  i 

uocnran  \j9  p.  **■  —  •  •  *  ,  ..  » . 

the  two  authora  mentioned  last  treat  these  examples  by  another  method, 
however. 
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